SEQUENCE LISTING 

<110> Cox III, George Norbert 
Case, Casey Christopher 
Eisenberg, Stephen P. 
Jarvis, Eric Edward 
Spratt, Sharon Kaye 
Sangamo Biosciences, Inc. 

<120> Regulation of Endogenous Gene Expression in Cells Using 
Zinc Finger Proteins 

<130> 019496-002200US 

<140> 09/229,037 
<141> 1999-01-12 

<160> 40 

<170> PatentIn Ver. 2.0 

<210> 1 
<211> 25 
<212> PRT 

<213> Artificial Sequence 

^^^^^ . ^. ^ ^-F artificial Sequence: exemplary motif 

<223> Description of Artiticiai o^mu /^ttdn 
of C2H2 class of zinc finger proteins (ZFP) 

<220> 

<221> MOD_RES 

<222> (2) . . (3) 

<223> Xaa = any amino acid 

<220> 

<221> MOD_RES 

^223^ xli'=^any amino acid, may be present or absent 

<220> 

<221> MOD_RES 

<222> (7) . . (18) 

<223> Xaa = any amino acid 

<220> 

<221> MOD_RES 

<222> (20) . . (22) 

<223> Xaa = any amino acid 

<220> 

<221> MOD_RES 

iia':-Iny',»lno .eld, «y b, present or .bsent 

cJfL^ X.a xaa xaa Cys Xa. Xa. Xaa Xaa Xaa Xaa Xaa Xaa Xa. Xaa 

1 5 

xaa Xaa His Xaa Xaa Xaa Xaa Xaa His 
20 



p 



<210> 2 
<211> 10 
<212> DNA 

<213> Artificial Sequence 

^223^ Description of Artificial Sequence :ZFP target site 
with two overlapping D-able subsites 

<220> 

<221> modified__base 
<222> (1) . . (2) 
<223> n = g.a^c or t 

<220> 

<221> modif ied_base 
<222> (5) 

<223> n = g/a,c or t 
<220> 

<221> modif ied_base 
<222> (8) 

<223> n = g, a, c or t 
<220> 

<221> modif ied_base 

^223^ nl a,c or t; if g, then position 10 cannot be g 
or t 

<220> 

<221> modif ied_base 

<IIT> n-a or c; if g or t, then position 9 cannot be g 

<400> 2 
nngkngknnn 



<210> 3 
<211> 10 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence :ZFP target sit 
with three overlapping D-able subsites 

<220> 

<221> modified_base 
<222> (1) . . (2) 
<223> n = g,a,c or t 

<220> 

<221> modif ied_base 
<222> (5) 

<223> n = g/a,c or t 



) 

3 



<220> 

<221> modif ied__base 
<222> (8) 

<223> n = q,a,c or t 

<400> 3 
nngkngkngk 



<210> 4 
<211> 5 
<212> PRT 

<213> Artificial Sequence 

<220> , . ^ 

<223> Description of Artificial Sequence : linker 

<400> 4 

Asp Gly Gly Gly Ser 
1 5 



<210> 5 
<211> 5 
<212> PRT 

<213> Artificial Sequence 

<220> ^ . ^ 

<223> Description of Artificial Sequence : linker 

<400> 5 

Thr Gly Glu Lys Pro 
1 5 



<210> 6 
<211> 9 
<212> PRT 

<213> Artificial Sequence 

<220> ^ , , 

<223> Description of Artificial Sequence : linker 

<400> 6 

Leu Arg Gin Lys Asp Gly Glu Arg Pro 
^ 5 



<210> 7 
<211> 4 
<212> PRT 

<213> Artificial Sequence 

<220> , . , 

<223> Description of Artificial Sequence : linker 



<400> 7 

Gly Gly Arg Arg 
1 




4 



<210> 8 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : linker 
<400> 8 

Gly Gly Gly Gly Ser 
1 5 



<210> 9 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : linker 
<400> 9 

Gly Gly Arg Arg Gly Gly Gly Ser 
1 5 



<210> 10 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Ixnker 
<400> 10 

Leu Arg Gin Arg Asp Gly Glu Arg Pro 
1 5 



<210> 11 
<211> 12 
<212> PRT 

<213> Artificial Sequence 

<220> . , 

<223> Description of Artificial Sequence : linker 

<400> 11 

Leu Arg Gin Lys Asp Gly Gly Gly Ser Glu Arg Pro 
1 5 10 



<210> 12 
<211> 16 
<212> PRT 

<213> Artificial Sequence 



<220> ^ . ^ 

<223> Description of Artificial Sequence : linker 



Leu°Arg'Gln Lys Asp Gly Gly Gly Ser Gly Gly Gly Ser Glu Arg Pro 

-1 ^ 10 



<210> 13 
<211> 25 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence :ZFP target site 
region surrounding initiation site of vascular 
endothelial growth factor (VEGF) gene contaxnxng 
two 9-base pair target sites 

<220> 

<221> protein__bind 
<222> ( 4 ) . . ( 12 ) 

<223> upstream 9-base pair ZFP VEGFl target site 
<220> 

<221> protein_bind 

^223^ downs^reai 9-base pair ZFP VEGF3a target site 

<400> 13 25 
agcggggagg atcgcggagg cttgg 

<210> 14 
<211> 298 
<212> DNA 

<213> Artificial Sequence 

^223? Description of Artificial Sequence : VEGFl ZFP 

construct targeting upstream 9-base paxr target 
site in VEGF promoter 

<220> 
<221> CDS 
<222> (2) . . (298) 
<223> VEGFl 

^ s s 1% '^i s I- ^ ill s o| ^ 
- Ill -I ir. s s s r„ III z z iii 2= 

20 2^ 

s III III K', III - c'.^ - - i - - - 

35 ^0 

s 1% III ill J'n zi o?? s rj 5i z - 



97 



145 



193 



50 55 60 



= =.^ aaa ttt act tgc ccg gag tgt ccg aag cgc ttc atg 
III III III lys Phe III cls p/o Clu Cys Pro Lys Arg Phe Met 
65 ''O 
^ aac cac ctg tec cgt cac ate aag acc cac cag aat aag aag 

Z sfr III Sis lei Ser Arg His He Lys Thr His Gin Asn Lys Lys 
8 5 



ggt gga tec 
Gly Giy Ser 



241 



289 



298 



<210> 15 
<211> 99 
<212> PRT 

<213> Artificial Sequence 
<400> 15 



Jlfpro lie pro Gly Lys Lys Lys Gin His He Cys His He Gin Gly 
1 5 10 

, cys Gly Lys Val Tyr Gly Thr Thr Ser Asn Leu Arg Arg His Leu Arg 

20 2b 

Trp His Thr Gly Glu Arg Pro Phe Met Cys Thr Trp Ser Tyr Cys Gly 

35 

Lys Arg Phe Thr Arg Ser Ser Asn Leu Gin Arg His Lys Arg Thr His 

50 

Thr Gly Glu Lys Lys Phe Ala Cys Pro Glu Cys Pro Lys Arg Phe Met 

65 ^0 
Arg ser Asp His Leu Ser Arg His He Lys Thr His Gin Asn Lys Lys 

8 5 



Gly Gly Ser 



<210> 16 
<211> 298 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : VEGF3a ZFP 

conStiuct targeting downstream 9.base paxr target 
site in VEGF promoter 



<220> 
<221> CDS 
<222> (2) . . (298) 
<223> VEGF3a 



<400> 16 

g gta CCG ata cot ggc 
Val Pro He Pro Gly 
1 5 



aag aag aag cag cac 
Lys Lys Lys Gin His 
10 



ate tgc cac ate cag ggc 
He Cys His He Gin Gly 
15 



- Ill Z t\ ".^ III S S Z III 2; ^ '^l " 

20 

- T.i ?| IS IS s s s '.s cf. z 
r,' ir, 'I! z III s r„ iii ir. i z iii i- 

50 

= nrt tac ccg gag tgt ccg aag cgc ttc atg 

acc ggt gag aag aaa ttt get tgc ccg g g g 
Thr Gly Glu Lys Lys Phe Ala Cys fro ^.xu y 

65 ^0 
c.a agt gac gag ctg tea cga cat ate aag acc cac cag aac aag aag 
Arg Ser Asp Glu Leu Ser Arg His iie 



I. u 



193 



240- 



289 



298 



ggt gga tec 
O Gly Gly Ser 



"2 <210> 17 

fi <211> 99 

^ <212> PRT 

W <213> Artificial Sequence 



<400> 17 „. -rig Cys His He Gin Gly 

Val Pro He Pro Gly Lys Lys Lys Gin His He cys 

1 5 



^^r. cior qer Asp Leu Gin Arg His Leu Arg 
Cys Gly Lys Val Tyr Gly Gin Ser Ser Asp i. 

20 

Xrp His Oly Gl„ Pro Ph. Mat Cys Thr Trp S,. Ty. Cy, Gly , 

35 

50 

Th. Gly Olu Lya Lys Phe M, Cys Pro =10 cys Pro Lys .r. Ph. H,t 
65 '^^ 

ser ^s, Glu Leu Ser HI, n, Lys Thr His Glh .s„ Lys Lys . , 

8 5 

Gly Gly Ser 



<210> 18 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<220> . 1 o^^^onr-e- VEGF DNA target 

<223> Description of Artificial Sequence. Vbc.t 

site 1 recognition (top) strand 



<220> 

<221> protein__bind 

<222> {ID . . (19) 

<223> VEGF DNA ZFP target site 

<400> 18 

catgcatagc ggggaggatc gccatcgat 



<210> 19 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : VEGF DNA site 1 
complementary (bottom) strand 

<400> 19 

atcgatggcg atcctccccg ctatgcatg 



<210> 20 
<211> 29 
<212> DNA 

<213> Artificial Sequence 



^223^ Description of Artificial Sequence : VEGF DNA 
target site 3 recognition (top) strand 



<220> 

<221> protein_bind 
<222> (11) . . (19) 

<223> VEGF DNA ZFP target site - 
<400> 20 

catgcatatc gcggaggctt ggcatcgat 



<210> 21 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

^^^^^ . ^-F AT-tificial Sequence: VEGF DNA target 

<223> Description of Artiriciai oe^uc;! 

site 3 complementary (bottom) strand 
<400> 21 

atcgatgcca agcctccgcg atatgcatg 



<210> 22 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence : primer SPE7 



<400> 22 29 
gagcagaatt cggcaagaag aagcagcac 

<210> 23 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

<220> ^ ^.^^ Artificial Sequence: primer SPEampl2 

<223> Description of Artiricidx oc^i 

<400> 23 26 
gtggtctaga cagctcgtca cttcgc 

<210> 24 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

^223^ Description of Artificial Sequence : primer SPE 
amp 13 

<400> 24 28 
ggagccaagg ctgtggtaaa gtttacgg 



<210> 25 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



. ^- ^ artificial Sequence: primer SPEampll 

<223> Description of Artiriciai o^h^ 

<400> 25 26 
ggagaagctt ggatcctcat tatccc 



<210> 26 
<211> 83 
<212> DNA 

<213> Artificial Sequence 

^223^ Description of Artificial Sequence : sequence 
ligated between Xbal and Styx sites 

^cSagacLa tcaaaaccca ccagaacaag aaagacggcg gtggcagcgg caaaaagaaa 60 

83 

cagcacatat gtcacatcca agg 

<210> 27 
<211> 39 
<212> DNA 

<213> Artificial Sequence 



10 



<220> 
<2 



23> Description of Artificial Sequence : primer GB19 



<400> 27 

gccatgccgg tacccatacc tggcaagaag aagcagcac 



39 



<210> 28 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description o 



f Artificial Sequence : primer GBIO 



<400> 28 

cagatcggat ccacccttct tattctggtg ggt 



33 



<210> 29 
<211> 589 
<212> DNA 

<213> Artificial Sequence 

^<223> Description of Artificial Sequence : designed 
6-finger ZFP VEGF3a/l from Kpnl to BamHl 

<220> 
<221> CDS 
<222> (2) . . (589) 
<223> VEGF3a/l 



cL .t. =« ,,c ,a, aa, a,, ca, ca= a« c.c „c ca, 



tl Pfo P« Hv I7s 5l„ His lie CVS Hi, lie Cin Oiy 

1 5 10 

1-^^ aat aaa att tac ggc cag tec tec gac ctg cag cgt cac ctg cgc 
lis gTv Lys ?al ^yr lly Gin Ser Ser Asp Leu Gin Arg His Leu Arg 
20 25 

1-na cac acc qgc gag agg act ttc atg tgt acc tgg tec tac tgt ggt 
T?p nTs ?hr lly Glu Arg Pro Phe Met Cys Thr Trp Ser Tyr Cys Gly 
35 40 

'^l Z III ill III III s| Ill HI til S| III III I- 

?| i til 17. s zi III III c| iT^ r.i z III q 

III III III III z I- III q III III r=n 17 "I 

1% III It III III III ITs 17 17 III 17 'II 'Zl Tu III HI 

100 105 



97 



145 



193 



241 



289 



337 



11 



qqc tgt ggt aaa gtt tac ggc aca acc tea aat ctg cgt cgt cac ctg 
Glv Cys Gly Lys Val Tyr Gly Thr Thr Ser Asn Leu Arg Arg His Leu 
115 120 125 

cqc tag cac acc ggc gag agg cct ttc atg tgt acc tgg tec tac tgt 
Arg Trp His Thr Gly Glu Arg Pro Phe Met Cys Thr Trp Ser Tyr Cys 
130 135 140 

qgt aaa cgc ttc acc cgt teg tea aac ctg cag cgt cac aag cgt acc 
Glv Lvs Arg Phe Thr Arg Ser Ser Asn Leu Gin Arg His Lys Arg Thr 
145 150 155 160 

cac acc ggt gag aag aaa ttt get tgc ccg gag tgt ccg aag cgc ttc 
His Thr Gly Glu Lys Lys Phe Ala Cys Pro Glu Cys Pro Lys Arg Phe 
165 I'^O 175 

atq cgt agt gac cac ctg tee cgt cac ate aag ace cac cag aat aag 
Met Arg Ser Asp His Leu Ser Arg His He Lys Thr His Gin Asn Lys 
180 185 190 

aag ggt gga tec 
Lys Gly Gly Ser 
195 



385 



433 



481 



5?9 



577 



589 



<210> 30 
<211> 196 
<212> PRT 

<213> Artificial Sequence 

Vai°Pro°ile Pro Gly Lys Lys Lys Gin His He Cys His He Gin Gly 
1 5 10 15 

Cys Gly Lys Val Tyr Gly Gin Ser Ser Asp Leu Gin Arg His Leu Arg 
20 25 30 

Trp His Thr Gly Glu Arg Pro Phe Met Cys Thr Trp Ser Tyr Cys Gly 
35 40 45 

Lys Arg Phe Thr Arg Ser Ser Asn Leu Gin Arg His Lys Arg Thr His 
50 55 50 

Thr Gly Glu Lys Lys Phe Ala Cys Pro Glu Cys Pro Lys Arg Phe Met 



65 



70 75 80 



Arc Ser Asp Glu Leu Ser Arg His He Lys Thr His Gin Asn Lys Lys 
^ 85 90 95 

Asp Gly Gly Gly Ser Gly Lys Lys Lys Gin His He Cys His He Gin 
100 105 110 

Glv Cys Gly Lys Val Tyr Gly Thr Thr Ser Asn Leu Arg Arg His Leu 
115 120 125 

Arq Trp His Thr Gly Glu Arg Pro Phe Met Cys Thr Trp Ser Tyr Cys 
130 135 140 

Glv Lys Arg Phe Thr Arg Ser Ser Asn Leu Gin Arg His Lys Arg Thr 
145 150 155 .160 




His Thr Gly Glu Lys Lys Phe Ala Cys Pro Glu Cys Pro Lys Arg Phe 
165 170 175 

Met Arg Ser Asp His Leu Ser Arg His lie Lys Thr His Gin Asn Lys 
180 185 190 



Lys Gly Gly Ser 
195 



<210> 31 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : JVF9 VEGF3a/l 
target oligonucleotide 

<400> 31 

agcgagcggg gaggatcgcg gaggcttggg gcagccgggt ag 



<210> 32 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : JVFIO VEGF3a/l 
target oligonucleotide complementary sequence 

<400> 32 

cgctctaccc ggctgcccca agcctccgcg atcctccccg ct 



<210> 33 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer JVF24 
<400> 33 

cgcggatccg cccccccgac cgatg ^ 25 



<210> 34 
<211> 62 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : downstream 
primer JVF25 

<400> 34 

ccgcaagctt acttgtcatc gtcgtccttg tagtcgctgc ccccaccgta ctcgtcaatt 60 



cc 



62 



<210> 35 
<211> 7 
<212> PRT 

<213> Simian virus 40 
<220> 

<221> PEPTIDE 
<222> (1) . . (7) 

<223> SV40 large T antigen nuclear localization sequence 
(NLS) 

<400> 35 

Pro Lys hys Lys Arg Lys Val 
1 5 



<210> 36 
<211> 61 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : segment from 

EcoRI to Kpnl containing Kozak sequence including 
initiation codon and SV40 NLS 

<400> 36 

gaattcgcta gcgccaccat ggcccccaag aagaagagga aggtgggaat ccatggggta 60 
c 61 



<210> 37 
<211> 187 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : segment from 
Kpnl to Xhol containing BamHI site, KRAB-A box 
from KOXl, FLAG epitope and Hindlll site ■ - 

<400> 37 

ggtacccggg gatcccggac actggtgacc ttcaaggatg tatttgtgga cttcaccagg 60 

gaggagtgga agctgctgga cactgctcag cagatcgtgt acagaaatgt gatgctggag 120 

aactataaga acctggtttc cttgggcagc gactacaagg acgacgatga caagtaagct 180 
tctcgag 187 



<210> 38 
<211> 277 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : inserted 
fragment from BamHI to Hindlll sites 




14 



<400> 38 

ggatccgccc ccccgaccga tgtcagcctg ggggacgagc tccacttaga cggcgaggac 60 
gtggcgatgg cgcatgccga cgcgctagac gatttcgatc tggacatgtt gggggacggg 120 



gccgacttcg agtttgagca gatgtttacc gatgcccttg gaattgacga gtacggtggg 240 
ggcagcgact acaaggacga cgatgacaag taagctt 277 

<210> 39 

<211> 118 ' 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : sequence 
replacing NLS-KRAB-FLAG with NLS-FLAG only 

<400> 39 

gaattcgcta gcgccaccat ggcccccaag aagaagagga aggtgggaat ccatggggta 60 
cccggggatg gatccggcag cgactacaag gacgacgatg acaagtaagc ttctcgag 118 

<210> 40 
<211> 204 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : insert into 
Mlul/Bglll sites of pGL3-Control to create 



<400> 40 

acgcgtaagc ttgctagcga gcggggagga tcgcggaggc ttggggcagc cgggtagagc 60 

gagcggggag gatcgcggag gcttggggca gccgggtaga gcgagcgggg aggatcgcgg 120 

aggcttgggg cagccgggta gagcgagcgg ggaggatcgc ggaggcttgg ggcagccggg 180 



gattccccgg 



ggccgggatt taccccccac 



gactccgccc cctacggcgc tctggatatg 180 



pVFRl-4x 



tagagcgctc agaagcttag atct 



204 



